Skip to content

Fill DequantizeLinear CUDA opset gap (21 → 25)#27745

Closed
Copilot wants to merge 4 commits intomainfrom
copilot/update-dequantizelinear-op
Closed

Fill DequantizeLinear CUDA opset gap (21 → 25)#27745
Copilot wants to merge 4 commits intomainfrom
copilot/update-dequantizelinear-op

Conversation

Copy link
Copy Markdown
Contributor

Copilot AI commented Mar 18, 2026

Description

Caps the existing CUDA DequantizeLinear non-versioned opset 21 registration to versioned 21–24, and adds a new non-versioned opset 25 registration with identical type support.

quantize_linear.cc

  • REGISTER_DQ_KERNEL_TWO_TYPED_21REGISTER_DQ_KERNEL_TWO_TYPED_21_24 (uses ONNX_OPERATOR_VERSIONED_TWO_TYPED_KERNEL_EX)
  • New REGISTER_DQ_KERNEL_TWO_TYPED_25 (uses ONNX_OPERATOR_TWO_TYPED_KERNEL_EX)

cuda_execution_provider.cc

  • Forward declarations: 12 entries changed from ONNX_OPERATOR_TWO_TYPED_KERNEL_CLASS_NAME(..., 21, ...) to ONNX_OPERATOR_VERSIONED_TWO_TYPED_KERNEL_CLASS_NAME(..., 21, 24, ...)
  • 12 new forward declarations + BuildKernelCreateInfo entries added for opset 25

Same type matrix across both ranges: {uint8_t, int8_t, UInt4x2, Int4x2} × {float, MLFloat16} + Float8 variants.

Motivation and Context

CUDA DequantizeLinear was registered only up to opset 21; ONNX spec defines it through opset 25. Models exported at opset ≥22 would fail kernel lookup on CUDA EP.

Follows the standard opset gap-filling pattern (see e.g. #27728).

Related: #27729

…, add opset 25

Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>
@tianleiwu
Copy link
Copy Markdown
Contributor

/azp run Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

Copilot AI and others added 2 commits March 18, 2026 16:37
…nel registration

Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>
@tianleiwu tianleiwu marked this pull request as ready for review March 18, 2026 23:48
Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

Comment thread onnxruntime/core/providers/cuda/tensor/quantize_linear.cc Outdated
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@tianleiwu tianleiwu closed this Apr 12, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants